SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Lau, Lester F. 
(ii) TITLE OF INVENTION: Extracellular Matrix Signalling Molecules 
(iii) NUMBER OF SEQUENCES: 17 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Marshall, 0' Toole, Gerstein, Murray & Borun 

(B) STREET: 6300 Sears Tower, 233 South Wacker Drive 

(C) CITY: Chicago 

(D) STATE: Illinois 

(E) COUNTRY: United States of America 

(F) ZIP: 60606-6402 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1-0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Clough, David W. 

(B) REGISTRATION NUMBER: 36,107 

(C) REFERENCE/DOCKET NUMBER: 28758/33766 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 312/474-6300 

(B) TELEFAX: 312/474-0448 

(C) TELEX: 25-3856 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1480 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 180.. 1316 



(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(D) OTHER INFORMATION: "Mouse cyr61 cDNA coding sequence" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

CGAGAGCGCC CCAGAGAAGC GCCTGCAATC TCTGCGCCTC CTCCGCCAGC ACCTCGAGAG 60 

AAGGACACCC GCCGCCTCGG CCCTCGCCTC ACCGCACTCC GGGCGCATTT GATCCCGCTG 120 

CTCGCCGGCT TGTTGGTTCT GTGTCGCCGC GCTCGCCCCG GTTCCTCCTG CGCGCCACA 17 9 

ATG AGC TCC AGC ACC TTC AGG ACG CTC GCT GTC GCC GTC ACC CTT CTC 227 

Met Ser Ser Ser Thr Phe Arg Thr Leu Ala Val Ala Val Thr Leu Leu 
1 5 10 15 

CAC TTG ACC AGA CTG GCG CTC TCC ACC TGC CCC GCC GCC TGC CAC TGC 275 

His Leu Thr Arg Leu Ala Leu Ser Thr Cys Pro Ala Ala Cys His Cys 
20 25 30 

H; CCT CTG GAG GCA CCC AAG TGC GCC CCG GGA GTC GGG TTG GTC CGG GAC 323 

O Pro Leu Glu Ala Pro Lys Cys Ala Pro Gly Val Gly Leu Val Arg Asp 
O 35 40 4 5 

• 

yj GGC TGC GGC TGC TGT AAG GTC TGC GCT AAA CAA CTC AAC GAG GAC TGC 371 

SI Gly Cys Gly Cys Cys Lys Val Cys Ala Lys Gin Leu Asn Glu Asp Cys 
fjj " 50 " 55 60 

f* AGC AAA ACT CAG CCC TGC GAC CAC ACC AAG GGG TTG GAA TGC AAT TTC 419 

!L Ser Lys Thr Gin Pro Cys Asp His Thr Lys Gly Leu Glu Cys Asn Phe 

p 65 70 75 80 



i¥ GGC GCC AGC TCC ACC GCT CTG AAA GGG ATC TGC AGA GCT CAG TCA GAA 4 67 

fij Gly Ala Ser Ser Thr Ala Leu Lys Gly He Cys Arg Ala Gin Ser Glu 

p 85 90 95 

GGC AGA CCC TGT GAA TAT AAC TCC AGA ATC TAC CAA AAC GGG GAA AGC 515 

Gly Arg Pro Cys Glu Tyr Asn Ser Arg He Tyr Gin Asn Gly Glu Ser 
100 105 110 

TTC CAG CCC AAC TGT AAA CAC CAG TGC ACA TGT ATT GAT GGC GCC GTG 563 

Phe Gin Pro Asn Cys Lys His Gin Cys Thr Cys He Asp Gly Ala Val 
115 ~ 120 125 

GGC TGC ATT CCT CTG TGT CCC CAA GAA CTG TCT CTC CCC AAT CTG GGC 611 

Gly Cys He Pro Leu Cys Pro Gin Glu Leu Ser Leu Pro Asn Leu Gly 

130 135 140 

TGT CCC AAC CCC CGG CTG GTG AAA GTC AGC GGG CAG TGC TGT GAA GAG 659 

Cys Pro Asn Pro Arg Leu Val Lys Val Ser Gly Gin Cys Cys Glu Glu 

145 150 155 160 

TGG GTT TGT GAT GAA GAC AGC ATT AAG GAC TCC CTG GAC GAC CAG GAT 707 

Trp Val Cys Asp Glu Asp Ser He Lys Asp Ser Leu Asp Asp Gin Asp 
165 170 175 



GAC CTC CTC GGA CTC GAT GCC TCG GAG GTG GAG TTA ACG AGA AAC AAT 
Asp Leu Leu Gly Leu Asp Ala Ser Glu Val Glu Leu Thr Arg Asn Asn 
180 185 190 



755 



GAG TTA ATC GCA ATT GGA AAA GGC AGC TCA CTG AAG AGG CTT CCT GTC 803 

Glu Leu lie Ala lie Gly Lys Gly Ser Ser Leu Lys Arg Leu Pro Val 

195 200 205 

TTT GGC ACC GAA CCG CGA GTT CTT TTC AAC CCT CTG CAC GCC CAT GGC 851 

Phe Gly Thr Glu Pro Arg Val Leu Phe Asn Pro Leu His Ala His Gly 

210 215 220 

CAG AAA TGC ATC GTT CAG ACC ACG TCT TGG TCC CAG TGC TCC AAG AGC 8 99 

Gin Lys Cys lie Val Gin Thr Thr Ser Trp Ser Gin Cys Ser Lys Ser 

225 230 235 " 240 

TGC GGA ACT GGC ATC TCC ACA CGA GTT ACC AAT GAC AAC CCA GAG TGC 94 7 

Cys Gly Thr Gly lie Ser Thr Arg Val Thr Asn Asp Asn Pro Glu Cys 

245 250 ' 255 

CGC CTG GTG AAA GAG ACC CGG ATC TGT GAA GTG CGT CCT TGT GGA CAA 995 

y ; Arg Leu Val Lys Glu Thr Arg lie Cys Glu Val Arg Pro Cys Gly Gin 

h 260 2 65 270 

^ CCA GTG TAC AGC AGC CTA AAA AAG GGC AAG AAA TGC AGC AAG ACC AAG 104 3 

f\ Pro Val Tyr Ser Ser Leu Lys Lys Gly Lys Lys Cys Ser Lys Thr Lys 

W 275 280 285 

111 AAA TCC CCA GAA CCA GTC AGA TTT ACT TAT GCA GGA TGC TCC AGT GTC 1091 

bj Lys Ser Pro Glu Pro Val Arg Phe Thr Tyr Ala Gly Cys Ser Ser Val 

I 290 295 300 

yf AAG AAA TAC CGG CCC AAA TAC TGC GGC TCC TGC GTA GAT GGC CGG TGC 1139 

Lys Lys Tyr Arg Pro Lys Tyr Cys Gly Ser Cys Val Asp Gly Arg Cys 

305 310 315 320 

TGC ACA CCT CTG CAG ACC AGA ACT GTG AAG ATG CGG TTC CGA TGC GAA 1187 

Cys Thr Pro Leu Gin Thr Arg Thr Val Lys Met Arg Phe Arg Cys Glu 

325 330 335 

GAT GGA GAG ATG TTT TCC AAG AAT GTC ATG ATG ATC CAG TCC TGC AAA 1235 

Asp Gly Glu Met Phe Ser Lys Asn Val Met Met lie Gin Ser Cys Lys 

340 345 350 

TGT AAC TAC AAC TGC CCG CAT CCC AAC GAG GCA TCG TTC CGA CTG TAC 1283 

Cys Asn Tyr Asn Cys Pro His Pro Asn Glu Ala Ser Phe Arg Leu Tyr 

355 360 365 

AGC CTA TTC AAT GAC ATC CAC AAG TTC AGG GAC TAAGTGCCTC CAGGGTTCCT 1336 

Ser Leu Phe Asn Asp lie His Lys Phe Arg Asp 
370 ~ 375 

AGTGTGGGCT GGACAGAGGA GAAGCGCAAG CATCATGGAG ACGTGGGTGG GCGGAGGATG 1396 

AATGGTGCCT TGCTCATTCT T GAG T AGC AT TAGGGTATTT CAAAACTGCC AAGGGGCTGA 1456 

TGTGGACGGA CAGCAGCGCA GCCG 14 80 

(2) INFORMATION FOR SEQ ID NO: 2: 



(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 37 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(A) NAME /KEY: misc_f eature 

(D) OTHER INFORMATION: "Mouse Cyr61 amino acid sequence" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Ser Ser Ser Thr Phe Arg Thr Leu Ala Val Ala Val Thr Leu Leu 
15 10 15 

His Leu Thr Arg Leu Ala Leu Ser Thr Cys Pro Ala Ala Cys His Cys 
20 25 30 

Pro Leu Glu Ala Pro Lys Cys Ala Pro Gly Val Gly Leu Val Arg Asp 
35 40 45 

Gly Cys Gly Cys Cys Lys Val Cys Ala Lys Gin Leu Asn Glu Asp Cys 
50 55 " 60 

Ser Lys Thr Gin Pro Cys Asp His Thr Lys Gly Leu Glu Cys Asn Phe 
65 * 70 75 80 

Gly Ala Ser Ser Thr Ala Leu Lys Gly lie Cys Arg Ala Gin Ser Glu 
85 90 95 

Gly Arg Pro Cys Glu Tyr Asn Ser Arg lie Tyr Gin Asn Gly Glu Ser 
100 105 110 

Phe Gin Pro Asn Cys Lys His Gin Cys Thr Cys lie Asp Gly Ala Val 
115 120 125 

Gly Cys lie Pro Leu Cys Pro Gin Glu Leu Ser Leu Pro Asn Leu Gly 
130 ~ 135 140 

Cys Pro Asn Pro Arg Leu Val Lys Val Ser Gly Gin Cys Cys Glu Glu 
145 150 155 160 

Trp Val Cys Asp Glu Asp Ser lie Lys Asp Ser Leu Asp Asp Gin Asp 
165 170 175 

Asp Leu Leu Gly Leu Asp Ala Ser Glu Val Glu Leu Thr Arg Asn Asn 
180 " 185 190 

Glu Leu lie Ala lie Gly Lys Gly Ser Ser Leu Lys Arg Leu Pro Val 
195 200 205 

Phe Gly Thr Glu Pro Arg Val Leu Phe Asn Pro Leu His Ala His Gly 
210 215 220 



Gin Lys Cys lie Val Gin Thr Thr Ser Trp Ser Gin Cys Ser Lys Ser 
225 230 235 240 



Cys Gly Thr Gly He Ser Thr Arg Val Thr Asn Asp Asn Pro Glu Cys 
245 250 255 

Arg Leu Val Lys Glu Thr Arg He Cys Glu Val Arg Pro Cys Gly Gin 
260 265 270 

Pro Val Tyr Ser Ser Leu Lys Lys Gly Lys Lys Cys Ser Lys Thr Lys 
275 280 285 

Lys Ser Pro Glu Pro Val Arg Phe Thr Tyr Ala Gly Cys Ser Ser Val 
290 295 300 

Lys Lys Tyr Arg Pro Lys Tyr Cys Gly Ser Cys Val Asp Gly Arg Cys 
305 1 ~ 310 315 320 

Cys Thr Pro Leu Gin Thr Arg Thr Val Lys Met Arg Phe Arg Cys Glu 
325 330 335 

EI Asp Gly Glu Met Phe Ser Lys Asn Val Met Met He Gin Ser Cys Lys 
g " 340 345 350 

^ Cys Asn Tyr Asn Cys Pro His Pro Asn Glu Ala Ser Phe Arg Leu Tyr 
W 355 360 365 

[fl Ser Leu Phe Asn Asp He His Lys Phe Arg Asp 
h I 370 375 

L (2) INFORMATION FOR SEQ ID NO: 3: 

Jf; (i) SEQUENCE CHARACTERISTICS: 
Mi (A) LENGTH: 1418 base pairs 

!U (B) TYPE: nucleic acid 

P (C) STRANDEDNESS: single 

f|| (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 124.. 1266 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(D) OTHER INFORMATION: "Human cyr61 cDNA coding sequence" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GGGCGGGCCC ACCGCGACAC CGCGCCGCCA CCCCGACCCC GCTGCGCACG GCCTGTCCGC 60 

TGCACACCAG CTTGTTGGCG TCTTCGTCGC CGCGCTCGCC CCGGGCTACT CCTGCGCGCC 120 

ACA ATG AGC TCC CGC ATC GCC AGG GCG CTC GCC TTA GTC GTC ACC CTT 168 
Met Ser Ser Arg He Ala Arg Ala Leu Ala Leu Val Val Thr Leu 
15 10 15 



CTC CAC TTG ACC AGG CTG GCG CTC TCC ACC TGC CCC GCT GCC TGC CAC 
Leu His Leu Thr Arg Leu Ala Leu Ser Thr Cys Pro Ala Ala Cys His 



216 





TGC 


CCC 




Cys 


Pro 




GAC 


GGC 




Asp 


Gly 




TGC 


AGC 




Cys 


Ser 
65 




TTC 


GGC 




Phe 
80 


Gly 




GAG 


GGC 




Glu 


Gly 




AGT 


TTC 


y .1 


Ser 


Phe 








pi 


GTG 


GGC 




Val 


Gly 


: y 






5 


GGC 


TGT 




Gly 


Cys 
145 




GAG 


TGG 




Glu 
160 


Trr> 




GAC 


GGC 




Asp 


Gly 




TTG 


ACG 




Leu 


Thr 




AAG 


CGG 




Lys 


Arg 




TTA 


CAA 




Leu 


Gin 

225 




TCA 


AAG 




Ser 
240 


Lys 



20 25 30 

CTG GAG GCG CCC AAG TGC GCG CCG GGA GTC GGG CTG GTC CGG 264 
Leu Glu Ala Pro Lys Cys Ala Pro Gly Val Gly Leu Val Arg 
35 40 45 

TGC GGC TGC TGT AAG GTC TGC GCC AAG CAG CTC AAC GAG GAC 312 
Cys Gly Cys Cys Lys Val Cys Ala Lys Gin Leu Asn Glu Asp 
50 55 60 

AAA ACG CAG CCC TGC GAC CAC ACC AAG GGG CTG GAA TGC AAC 360 
Lys Thr Gin Pro Cys Asp His Thr Lys Gly Leu Glu Cys Asn 
70 75 

GCC AGC TCC ACC GCT CTG AAG GGG ATC TGC AGA GCT CAG TCA 4 08 

Ala Ser Ser Thr Ala Leu Lys Gly lie Cys Arg Ala Gin Ser 
85 90 95 

AGA CCC TGT GAA TAT AAC TCC AGA ATC TAC CAA AAC GGG GAA 456 
Arg Pro Cys Glu Tyr Asn Ser Arg lie Tyr Gin Asn Gly Glu 
100 ' 105 110 

CAG CCC AAC TGT CAA CAT CAG TGC ACA TGT ATT GAT GGC GCC 504 
Gin Pro Asn Cys Gin His Gin Cys Thr Cys He Asp Gly Ala 
115 120 125 

TGC ATT CCT CTG TGT CCC CAA GAA CTA TCT CTC CCC AAC TTG 552 

Cys He Pro Leu Cys Pro Gin Glu Leu Ser Leu Pro Asn Leu 

130 135 140 

CCC AAC CCT CGG CTG GTC AAA GTT ACC GGG CAG TGC TGC GAG 600 

Pro Asn Pro Arg Leu Val Lys Val Thr Gly Gin Cys Cys Glu 

150 155 

GTC TGT GAC GAG GAT AGT ATC AAG GAC CCC ATG GAG GAC CAG 648 
Val Cys Asp Glu Asp Ser He Lys Asp Pro Met Glu Asp Gin 
165 170 175 

CTC CTT GGC AAG GAG CTG GGA TTC GAT GCC TCC GAG GTG GAG 696 
Leu Leu Gly Lys Glu Leu Gly Phe Asp Ala Ser Glu Val Glu 
180 185 190 

AGA AAC AAT GAA TTG ATT GCA GTT GGA AAA GGC AGA TCA CTG 74 4 

Arg Asn Asn Glu Leu He Ala Val Gly Lys Gly Arg Ser Leu 
195 200 205 

CTC CCT GTT TTT GGA ATG GAG CCT CGC ATC CTA TAC AAC CCT 7 92 

Leu Pro Val Phe Gly Met Glu Pro Arg He Leu Tyr Asn Pro 

210 215 220 

GGC CAG AAA TGT ATT GTT CAA ACA ACT TCA TGG TCC CAG TGC 84 0 

Gly Gin Lys Cys He Val Gin Thr Thr Ser Trp Ser Gin Cys 
230 235 

ACC TGT GGA ACT GGT ATC TCC ACA CGA GTT ACC AAT GAC AAC 888 
Thr Cys Gly Thr Gly He Ser Thr Arg Val Thr Asn Asp Asn 
245 250 255 



CCT GAG TGC CGC CTT GTG AAA GAA ACC CGG ATT TGT GAG GTG CGG CCT 936 
Pro Glu Cys Arg Leu Val Lys Glu Thr Arg lie Cys Glu Val Arg Pro 
260 265 270 

TGT GGA CAG CCA GTG TAC AGC AGC CTG AAA AAG GGC AAG AAA TGC AGC 984 
Cys Gly Gin Pro Val Tyr Ser Ser Leu Lys Lys Gly Lys Lys Cys Ser 
275 280 285 

AAG ACC AAG AAA TCC CCC GAA CCA GTC AGG TTT ACT TAC GCT GGA TGT 1032 
Lys Thr Lys Lys Ser Pro Glu Pro Val Arg Phe Thr Tyr Ala Gly Cys 
290 295 300 

TTG AGT GTG AAG AAA TAC CGG CCC AAG TAC TGC GGT TCC TGC GTG GAC 1080 
Leu Ser Val Lys Lys Tyr Arg Pro Lys Tyr Cys Gly Ser Cys Val Asp 
305 310 315 

GGC CGA TGC TGC ACG CCC CAG CTG ACC AGG ACT GTG AAG ATG CGG TTC 1128 
Gly Arg Cys Cys Thr Pro Gin Leu Thr Arg Thr Val Lys Met Arg Phe 
320 325 330 335 

CGC TGC GAA GAT GGG GAG ACA TTT TCC AAG AAC GTC ATG ATG ATC CAG 117 6 

Arg Cys Glu Asp Gly Glu Thr Phe Ser Lys Asn Val Met Met lie Gin 
340 345 350 

TCC TGC AAA TGC AAC TAC AAC TGC CCG CAT GCC AAT GAA GCA GCG TTT 1224 
Ser Cys Lys Cys Asn Tyr Asn Cys Pro His Ala Asn Glu Ala Ala Phe 
355 ~ 360 365 

CCC TTC TAC AGG CTG TTC AAT GAC ATT CAC AAA TTT AGG GAC 1266 
Pro Phe Tyr Arg Leu Phe Asn Asp He His Lys Phe Arg Asp 
370 375 380 

TAAATGCTAC CTGGGTTTCC AGGGCACACC TAGACAAACA AGGGAGAAGA GTGTCAGAAT 132 6 

C AG AAT CAT G GAGAAAATGG GCGGGGGTGG TGTGGGTGAT GGGACTCATT GTAGAAAGGA 1386 



AGCCTTCTCA TTCTTGAGGA GCATTAAGGT AT 1418 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 381 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(D) OTHER INFORMATION: "Human Cyr61 amino acid sequence" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



Met Ser Ser Arg He Ala Arg Ala Leu Ala Leu Val Val Thr Leu Leu 
15 10 15 



His Leu Thr Arg Leu Ala Leu Ser Thr Cys Pro Ala Ala Cys His Cys 
20 25 30 

Pro Leu Glu Ala Pro Lys Cys Ala Pro Gly Val Gly Leu Val Arg Asp 
35 40 45 

Gly Cys Gly Cys Cys Lys Val Cys Ala Lys Gin Leu Asn Glu Asp Cys 
50 55 60 

Ser Lys Thr Gin Pro Cys Asp His Thr Lys Gly Leu Glu Cys Asn Phe 
65 70 75 80 

Gly Ala Ser Ser Thr Ala Leu Lys Gly He Cys Arg Ala Gin Ser Glu 
85 90 95 

Gly Arg Pro Cys Glu Tyr Asn Ser Arg He Tyr Gin Asn Gly Glu Ser 
100 105 110 

Phe Gin Pro Asn Cys Gin His Gin Cys Thr Cys He Asp Gly Ala Val 
115 120 125 

Gly Cys He Pro Leu Cys Pro Gin Glu Leu Ser Leu Pro Asn Leu Gly 
130 135 140 

Cys Pro Asn Pro Arg Leu Val Lys Val Thr Gly Gin Cys Cys Glu Glu 
145 150 155 160 

Trp Val Cys Asp Glu Asp Ser He Lys Asp Pro Met Glu Asp Gin Asp 
165 170 175 

Gly Leu Leu Gly Lys Glu Leu Gly Phe Asp Ala Ser Glu Val Glu Leu 
180 185 190 

Thr Arg Asn Asn Glu Leu He Ala Val Gly Lys Gly Arg Ser Leu Lys 
195 200 205 

Arg Leu Pro Val Phe Gly Met Glu Pro Arg He Leu Tyr Asn Pro Leu 
210 215 220 

Gin Gly Gin Lys Cys He Val Gin Thr Thr Ser Trp Ser Gin Cys Ser 
225 1 ^ 230 235 240 

Lys Thr Cys Gly Thr Gly He Ser Thr Arg Val Thr Asn Asp Asn Pro 
245 250 255 

Glu Cys Arg Leu Val Lys Glu Thr Arg He Cys Glu Val Arg Pro Cys 
260 265 270 

Gly Gin Pro Val Tyr Ser Ser Leu Lys Lys Gly Lys Lys Cys Ser Lys 
275 280 285 

Thr Lys Lys Ser Pro Glu Pro Val Arg Phe Thr Tyr Ala Gly Cys Leu 
290 " 295 300 

Ser Val Lys Lys Tyr Arg Pro Lys Tyr Cys Gly Ser Cys Val Asp Gly 
305 ~ " 310 315 320 



Arg Cys Cys Thr Pro Gin Leu Thr Arg Thr Val Lys Met Arg Phe Arg 
325 330 335 

Cys Glu Asp Gly Glu Thr Phe Ser Lys Asn Val Met Met lie Gin Ser 
340 345 350 

Cys Lys Cys Asn Tyr Asn Cys Pro His Ala Asn Glu Ala Ala Phe Pro 
355 360 365 

Phe Tyr Arg Leu Phe Asn Asp lie His Lys Phe Arg Asp 
370 ^ 375 380 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2267 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(D) OTHER INFORMATION: "Fispl2 cDNA coding sequence" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GAATTCCGCC GACAACCCCA GACGCCACCG CCTGGAGCGT CCAGACACCA ACCTCCGCCC 60 

CTGTCCGAAT CCAGGCTCCA GCCGCGCCTC TCGTCGCCTC TGCACCCTGC TGTGCATCCT 120 

CCTACCGCGT CCCGATCATG CTCGCCTCCG TCGCAGGTCC CATCAGCCTC GCCTTGGTGC 180 

TCCTCGCCCT CTGCACCCGG CCTGCTACGG GCCAGGACTG CAGCGCGCAA TGTCAGTGCG 24 0 

CAGCCGAAGC AGCGCCGCAC TGCCCCGCCG GCGTGAGCCT GGTGCTGGAC GGCTGCGGCT 300 

GCTGCCGCGT CTGCGCCAAG CAGCTGGGAG AACTGTGTAC GGAGCGTGAC CCCTGCGACC 360 

CACACAAGGG CCTCTTCTGC GATTTCGGCT CCCCCGCCAA CCGCAAGATT GGAGTGTGCA 420 

CTGCCAAAGA TGGTGCACCC TGTGTCTTCG GTGGGTCGGT GTACCGCAGC GGTGAGTCCT 4 80 

TCCAAAGCAG CTGCAAATAC CAATGCACTT GCCTGGATGG GGCCGTGGGC TGCGTGCCCC 54 0 

TATGCAGCAT GGACGTGCGC CTGCCCAGCC CTGACTGCCC CTTCCCGAGA AGGGTCAAGC 600 

TGCCTGGGAA ATGCTGCAAG GAGTGGGTGT GTGACGAGCC CAAGGACCGC ACAGCAGTTG 660 

GCCCTGCCCT AGCTGCCTAC CGACTGGAAG ACACATTTGG CCCAGACCCA ACTATGATGC 720 

GAGCCAACTG CCTGGTCCAG AC C AC AG AG T GGAGCGCCTG TTCTAAGACC TGTGGAATGG 780 

GCATCTCCAC CCGAGTTACC AATGACAATA CCTTCTGCAG ACTGGAGAAG CAGAGCCGCC 84 0 

TCTGCATGGT CAGGCCCTGC GAAGCTGACC TGGAGGAAAA CATTAAGAAG GGCAAAAAGT 900 



GCATCCGGAC 


ACCTAAAATC 


GCCAAGCCTG 


TCAAGTTTGA 


GCTTTCTGGC 


TGCACCAGTG 


960 


TGAAGACATA 


CAGGGCTAAG 


TTCTGCGGGG 


TGTGCACAGA 


CGGCCGCTGC 


TGCACACCGC 


1020 


ACAGAACCAC 


CACTCTGCCA 


GTGGAGTTCA 


AATGCCCCGA 


TGGCGAGATC 


ATGAAAAAGA 


1080 


ATATGATGTT 


CATCAAGACC 


TGTGCCTGCC 


ATTACAACTG 


TCCTGGGGAC 


AATGACATCT 


1140 


TTGAGTCCCT 


GTACTACAGG 


AAGATGTACG 


GAGACATGGC 


GTAAAGCCAG 


GAAGTAAGGG 


1200 


ACACGAACTC 


ATTAGACTAT 


AACTTGAACT 


GAGTTGCATC 


TCATTTTCTT 


CTGTAAAAAC 


1260 


AATTACAGTA 


GCACATTAAT 


TTAAATCTGT 


GTTTTTAACT 


ACCGTGGGAG 


GAACTATCCC 


1320 


ACCAAAGTGA 


GAACGTTATG 


TCATGGCCAT 


ACAAGTAGTC 


TGTCAACCTC 


AGACACTGGT 


1380 


TTCGAGACAG 


TTTACACTTG 


ACAGTTGTTC 


ATTAGCGCAC 


AGTGCCAGAA 


CGCACACTGA 


1440 


GGTGAGTCTC 


CTGGAACAGT 


GGAGATGCCA 


GGAGAAAGAA 


AGACAGGTAC 


TAGCTGAGGT 


1500 


TATTTTAAAA 


GCAGCAGTGT 


GCCTACTTTT 


TGGAGTGTAA 


CCGGGGAGGG 


AAATTATAGC 


1560 


ATGCTTGCAG 


ACAGACCTGC 


TCTAGCGAGA 


GCTGAGCATG 


TGTCCTCCAC 


TAGATGAGGC 


1620 


TGAGTCCAGC 


TGTTCTTTAA 


GAACAGCAGT 


TTCAGCCTCT 


GACCATTCTG 


ATTCCAGTGA 


1680 


CACTTGTCAG 


GAGTCAGAGC 


CTTGTCTGTT 


AGACTGGACA 


GCTTGTGGCA 


AGTAAGTTTG 


1740 


CCTGTAACAA 


GCCAGATTTT 


TAT T GAT ATT 


GTAAATATTG 


TGGATATATA 


TATATATATA 


1800 


TATATTTGTA 


CAGTTATCTA 


AGTTAATTTA 


AAGTCATTTG 


TTTTTGTTTT 


AAGTGCTTTT 


1860 


GGGATTTTAA 


ACTGATAGCC 


TCAAACTCCA 


AACACCATAG 


GTAGGACACG 


AAGCTTATCT 


1920 


GTGATTCAAA 


ACAAAGGAGA 


TACTGCAGTG 


GGAATTGTGA 


CCTGAGTGAC 


TCTCTGTCAG 


1980 


AACAAACAAA 


TGCTGTGCAG 


GTGATAAAGC 


TATGTATTGG 


AAGTCAGATT 


TCTAGTAGGA 


2040 


AATGTGGTCA 


AATCCCTGTT 


GGTGAACAAA 


TGGCCTTTAT 


TAAGAAATGG 


CTGGCTCAGG 


2100 


GTAAGGTCCG 


ATTCCTACCA 


GGAAGTGCTT 


GCTGCTTCTT 


T GAT TAT G AC 


TGGTTTGGGG 


2160 


TGGGGGGCAG 


TTTATTTGTT 


GAGAGTGTGA 


CCAAAAGTTA 


CATGTTTGCA 


CCTTTCTAGT 


2220 


TGAAAATAAA 


GTATATATAT 


ATTTTTTATA 


TGAAAAAAAA 


GGAATTC 




2267 


(2) INFORMATION FOR SEQ ID NO: 6: 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE : 

(A) NAME /KEY : misc_feature 

(D) OTHER INFORMATION: "Fispl2 amino acid sequence" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Leu Ala Ser Val Ala Gly Pro He Ser Leu Ala Leu Val Leu Leu 
15 10 15 

Ala Leu Cys Thr Arg Pro Ala Thr Gly Gin Asp Cys Ser Ala Gin Cys 
20 25 30 

Gin Cys Ala Ala Glu Ala Ala Pro His Cys Pro Ala Gly Val Ser Leu 
35 40 45 

Val Leu Asp Gly Cys Gly Cys Cys Arg Val Cys Ala Lys Gin Leu Gly 
50 " 55 60 

Glu Leu Cys Thr Glu Arg Asp Pro Cys Asp Pro His Lys Gly Leu Phe 
65 ~ 70 75 80 

Cys Asp Phe Gly Ser Pro Ala Asn Arg Lys He Gly Val Cys Thr Ala 
85 90 95 

Lys Asp Gly Ala Pro Cys Val Phe Gly Gly Ser Val Tyr Arg Ser Gly 
100 105 HO 

Glu Ser Phe Gin Ser Ser Cys Lys Tyr Gin Cys Thr Cys Leu Asp Gly 
115 120 125 

Ala Val Gly Cys Val Pro Leu Cys Ser Met Asp Val Arg Leu Pro Ser 
130 ~ 135 140 

Pro Asp Cys Pro Phe Pro Arg Arg Val Lys Leu Pro Gly Lys Cys Cys 
145 ^ 150 155 160 

Lys Glu Trp Val Cys Asp Glu Pro Lys Asp Arg Thr Ala Val Gly Pro 
165 170 175 

Ala Leu Ala Ala Tyr Arg Leu Glu Asp Thr Phe Gly Pro Asp Pro Thr 
180 185 190 

Met Met Arg Ala Asn Cys Leu Val Gin Thr Thr Glu Trp Ser Ala Cys 
195 200 205 

Ser Lys Thr Cys Gly Met Gly He Ser Thr Arg Val Thr Asn Asp Asn 
210 215 220 

Thr Phe Cys Arg Leu Glu Lys Gin Ser Arg Leu Cys Met Val Arg Pro 
225 ~ 230 235 240 

Cys Glu Ala Asp Leu Glu Glu Asn He Lys Lys Gly Lys Lys Cys He 
245 250 255 



Arg Thr Pro Lys He Ala Lys Pro Val Lys Phe Glu Leu Ser Gly Cys 
260 2 65 27 0 



Thr Ser Val Lys Thr Tyr Arg Ala Lys Phe Cys Gly Val Cys Thr Asp 
275 280 285 

Gly Arg Cys Cys Thr Pro His Arg Thr Thr Thr Leu Pro Val Glu Phe 
290 295 300 

Lys Cys Pro Asp Gly Glu lie Met Lys Lys Asn Met Met Phe lie Lys 
305 ~ ~ 310 315 320 

Thr Cys Ala Cys His Tyr Asn Cys Pro Gly Asp Asn Asp He Phe Glu 
325 330 335 

Ser Leu Tyr Tyr Arg Lys Met Tyr Gly Asp Met Ala 
340 345 

(2) INFORMATION FOR SEQ ID NO: 7: 

H 1 ; (i) SEQUENCE CHARACTERISTICS: 
O (A) LENGTH: 2075 base pairs 

G (B) TYPE: nucleic acid 

111 (C) STRANDEDNESS: single 

yd (D) TOPOLOGY: linear 

m (ii) MOLECULE TYPE: DNA 

w (ix) FEATURE: 

* (A) NAME/KEY: misc_f eature 

& (D) OTHER INFORMATION: "CTGF cDNA coding sequence" 



i ■ \ 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
CCCGGCCGAC AGCCCCGAGA CGACAGCCCG GCGCGTCCCG GTCCCCACCT CCGACCACCG 60 

CCAGCGCTCC AGGCCCCGCG CTCCCCGCTC GCCGCCACCG CGCCCTCCGC TCCGCCCGCA 120 

GTGCCAACCA TGACCGCCGC CAGTATGGGC CCCGTCCGCG TCGCCTTCGT GGTCCTCCTC 180 

GCCCTCTGCA GCCGGCCGGC CGTCGGCCAG AACTGCAGCG GGCCGTGCCG GTGCCCGGAC 240 

GAGCCGGCGC CGCGCTGCCC GGCGGGCGTG AGCCTCGTGC TGGACGGCTG CGGCTGCTGC 300 

CGCGTCTGCG CCAAGCAGCT GGGCGAGCTG TGCACCGAGC GCGACCCCTG CGACCCGCAC 360 

AAGGGCCTCT TCTGTGACTT CGGCTCCCCG GCCAACCGCA AGATCGGCGT GTGCACCGCC 420 

AAAGATGGTG CTCCCTGCAT CTTCGGTGGT ACGGTGTACC GCAGCGGAGA GTCCTTCCAG 4 80 

AGCAGCTGCA AGTACCAGTG CACGTGCCTG GACGGGGCGG TGGGCTGCAT GCCCCTGTGC 540 

AGCATGGACG TTCGTCTGCC CAGCCCTGAC TGCCCCTTCC CGAGGAGGGT CAAGCTGCCC 600 

GGGAAATGCT GCGAGGAGTG GGTGTGTGAC GAGCCCAAGG ACCAAACCGT GGTTGGGCCT 660 

GCCCTCGCGG CTTACCGACT GGAAGACACG TTTGGCCCAG ACCCAACTAT GATTAGAGCC 720 

AACTGCCTGG TCCAGACCAC AGAGTGGAGC GCCTGTTCCA AGACCTGTGG GATGGGCATC 7 80 



TCCACCCGGG TTACCAATGA CAACGCCTCC TGCAGGCTAG AGAAGCAGAG CCGCCTGTGC 84 0 

ATGGTCAGGC CTTGCGAAGC TGACCTGGAA GAGAACATTA AGAAGGGCAA AAAGTGCATC 900 

CGTACTCCCA AAATCTCCAA GCCTATCAAG TTTGAGCTTT CTGGCTGCAC CAGCATGAAG 960 

ACATACCGAG CTAAATTCTG TGGAGTATGT ACCGACGGCC GATGCTGCAC CCCCCACAGA 1020 

ACCACCACCC TGCCGGTGGA GTTCAAGTGC CCTGACGGCG AGGTCATGAA GAAGAACATG 1080 

ATGTTCATCA AGACCTGTGC CTGCCATTAC AACTGTCCCG GAGACAATGA CATCTTTGAA 1140 

TCGCTGTACT ACAGGAAGAT GTACGGAGAC ATGGCATGAA GCCAGAGAGT GAGAGACATT 1200 

AACTCATTAG ACTGGAACTT GAACTGATTC ACATCTCATT TTTCCGTAAA AATGATTTCA 12 60 

GTAGCACAAG TTATTTAAAT CTGTTTTTCT AACTGGGGGA AAAGATTCCC ACCCAATTCA 1320 

AAACATTGTG CCATGTCAAA CAAATAGTCT ATCTTCCCCA GACACTGGTT TGAAGAATGT 1380 

0 TAAGACTTGA CAGTGGAACT AC AT TAG T AC ACAGCACCAG AATGTATATT AAGGTGTGGC 1440 
W TTTAGGAGCA GTGGGAGGGT ACCGGCCCGG TTAGTATCAT CAGATCGACT CTTATACGAG 1500 

TAATATGCCT GCTATTTGAA GTGTAATTGA GAAGGAAAAT TTTAGCGTGC TCACTGACCT 1560 

GCCTGTAGCC CCAGTGACAG CTAGGATGTG CATTCTCCAG CCATCAAGAG ACTGAGTCAA 1620 

M GTTGTTCCTT AAGTCAGAAC AGCAGACTCA GCTCTGACAT TCTGATTCGA ATGACACTGT 1680 

fy TCAGGAATCG GAATCCTGTC GATTAGACTG GACAGCTTGT GGCAAGTGAA TTTGCCTGTA 1740 

B ACAAGCCAGA TTTTTTAAAA TTTATATTGT AAATATTGTG TGTGTGTGTG TGTGTGTATA 1800 

1 u 

TATATATATA TATGTACAGT TATCTAAGTT AATTTAAAGT TGTTTGTGCC TTTTTATTTT 18 60 

TGTTTTTAAT GCTTTGATAT TTCAATGTTA GCCTCAATTT CTGAACACCA TAGGTAGAAT 1920 

GTAAAGCTTG TCTGATCGTT CAAAGCATGA AATGGATACT TATATGGAAA TTCTGCTCAG 1980 

ATAGAATGAC AGTCCGTCAA AACAGATTGT TTGCAAAGGG GAG G CATC AG TGTCTTGGCA 2 04 0 

GGCTGATTTC TAGGTAGGAA ATGTGGTAGC TCACG 2075 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(D) OTHER INFORMATION: "CTGF amino acid sequence" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Thr Ala Ala Ser Met Gly Pro Val Arg Val Ala Phe Val Val Leu 
1 5 10 15 

Leu Ala Leu Cys Ser Arg Pro Ala Val Gly Gin Asn Cys Ser Gly Pro 
20 25 30 

Cys Arg Cys Pro Asp Glu Pro Ala Pro Arg Cys Pro Ala Gly Val Ser 
35 40 45 

Leu Val Leu Asp Gly Cys Gly Cys Cys Arg Val Cys Ala Lys Gin Leu 
50 55 60 

Gly Glu Leu Cys Thr Glu Arg Asp Pro Cys Asp Pro His Lys Gly Leu 
65 70 75 80 

Phe Cys Asp Phe Gly Ser Pro Ala Asn Arg Lys lie Gly Val Cys Thr 
85 90 95 

Ala Lys Asp Gly Ala Pro Cys He Phe Gly Gly Thr Val Tyr Arg Ser 
100 105 110 

Gly Glu Ser Phe Gin Ser Ser Cys Lys Tyr Gin Cys Thr Cys Leu Asp 
115 120 125 



re Gly Ala Val Gly Cys Met Pro Leu Cys Ser Met Asp Val Arg Leu Pro 

130 135 14 0 



Ser Pro Asp Cys Pro Phe Pro Arg Arg Val Lys Leu Pro Gly Lys Cys 
145 L 150 155 160 

Cys Glu Glu Trp Val Cys Asp Glu Pro Lys Asp Gin Thr Val Val Gly 
165 170 175 

Pro Ala Leu Ala Ala Tyr Arg Leu Glu Asp Thr Phe Gly Pro Asp Pro 
180 185 190 

Thr Met He Arg Ala Asn Cys Leu Val Gin Thr Thr Glu Trp Ser Ala 
195 200 205 

Cys Ser Lys Thr Cys Gly Met Gly He Ser Thr Arg Val Thr Asn Asp 
210 215 220 

Asn Ala Ser Cys Arg Leu Glu Lys Gin Ser Arg Leu Cys Met Val Arg 
225 230 235 240 

Pro Cys Glu Ala Asp Leu Glu Glu Asn He Lys Lys Gly Lys Lys Cys 
245 250 255 

He Arg Thr Pro Lys He Ser Lys Pro He Lys Phe Glu Leu Ser Gly 
260 2 65 27 0 



Cys Thr Ser Met Lys Thr Tyr Arg Ala Lys Phe Cys Gly Val Cys Thr 
275 280 285 



Asp Gly Arg Cys Cys Thr Pro His 
290 295 

Phe Lys Cys Pro Asp Gly Glu Val 
305 ~ " 310 

Lys Thr Cys Ala Cys His Tyr Asn 
325 

Glu Ser Leu Tyr Tyr Arg Lys Met 
340 



Arg Thr Thr Thr Leu Pro Val Glu 
300 

Met Lys Lys Asn Met Met Phe lie 
315 320 

Cys Pro Gly Asp Asn Asp lie Phe 
330 335 

Tyr Gly Asp Met Ala 
345 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GGGGATCTGT GACGAGCCCA AGGAC 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GGGAATTCGA CCAGGCAGTT GGCTCG 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GGGGATCCTG TGATGAAGAC AG CAT T 26 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



h;f (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

ty GGGAATTCAA CGATGCATTT CTGGCC 26 

Iff (2) INFORMATION FOR SEQ ID NO: 13: 

III 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

G (ii) MOLECULE TYPE: peptide 

ri ?. 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Asp Gly Cys Gly Cys Cys Lys Val Cys Ala Lys Gin Leu Asn Glu Asp 
1 ~ l 5 ~ 10 15 

Cys Ser Lys Thr Gin 
20 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Pro Asn Cys Lys His Gin Cys Thr Cys He Asp Gly Ala Val Gly Cys 
1 " 5 10 15 

He Pro Leu Cys Pro 
20 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Cys He Val Gin Thr Thr Ser Trp Ser Gin Cys Ser Lys Ser Cys Gly 
15 10 15 

Thr Gly He Ser Thr Arg Val Thr 
20 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

He Ser Thr Arg Val Thr Asn Asp Asn Pro Glu Cys Arg Leu Val Lys 
15 10 15 

Glu Thr Arg He Cys Glu Val Arg Pro Cys 
20 25 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 



(D) TOPOLOGY: not relevant 
(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Lys Tyr Cys Gly Ser Cys Val Asp Gly Arg Cys Cys Thr Pro Leu Gin 
15 10 15 

Thr Arg Thr Val Lys 
20 



